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FIELD OF THE INVENTION 

The present invention relates to processing of information signals and, more 
particularly to processing of audio signals. 

5 BACKGROUND OF THE INVENTION 

The introduction of new systems like DVB and DVD has brought digital 
multi-channel sound within reach of a large group of users. The majority of users will 
however stay for a long time with stereo sound reproduction. 

One solution to both serve consumers with 2-channel equipment and multi- 

10 channel equipment is so-called simulcast. In this case, two separate information signals are 
transmitted in parallel, one containing a representation of the multi-channel sound and one 
containing a representation of the 2-channel sound. To achieve an economical use of 
transmission or storage capacity, audio bit rate reduction will be used in most applications. 
The transmitted or stored information signal will then be in the form of a coded bit stream, 

15 which requires a decoder to retrieve the audio signal to be reproduced. Nevertheless, it is 
obvious that simulcast is an expensive solution in terms of required transmission or storage 
capacity. This makes this solution unacceptable in most practical situations. 

Another solution is to transmit only the multi-channel information signal, 
directly serving the consumers with multi-channel sound reproduction equipment. The 2- 

20 channel users then need a decoder that consists of a multi-channel decoder, followed by a 

downmix module that creates a downmix from multi-channel to 2-channel. Such a 2-channel 
decoder is thus more complex than a regular multi-channel decoder. In this case, the 2- 
channel users (the majority) have to pay for the multi-channel capability of others. 

It is undesirable that those users are burdened by the multi-channel audio 

25 capabilities of a system, in the form of higher costs or higher power consumption. It is also 
undesirable to waste bandwidth of the system by simulcast (storage and transmission of both 
a 2-channel (stereo) and a multi-channel stream). 

An encoding system that allows a single coded multi-channel audio stream to 
be decoded by both a true stereo decoder and a multi-channel decoder is the MPEG-2 audio 
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backwards compatible multi-channel coder (MPEG-2 BC). In all other coding systems, the 
stereo decoder is basically a (an expensive) multi-channel decoder followed by a down-mix 
to stereo. 

The MPEG-2 BC coder achieves this by performing at the encoder side a 
5 down-mix from e. g. 5 channel sound to stereo, coding this as a pure stereo stream, and 
encoding as an extension three properly chosen signals out of the five input signals. The 
stereo decoder only decodes the pure stereo stream. A multi-channel decoder also decodes 
the extra information, and uses an inverse matrix to retrieve the original 5 channels from the 
down-mix and the additional three channels. This inverse matrix is encoded as side 
10 information in the coded bitstream. 

US-B1-6 275 589 describes MPEG-2 having backwards compatibility with 
MPEG-1, whereby the signals of multi-channel sound channels are matrixed. Stereo signals 
calculated in a process are then transmitted as an MPEG-1 -compatible stereo signal and 
remaining audio signals are transmitted as supplementary data. This method is known as 
1 5 "compatibility matrixing". 

In "Compatibility Matrixing of Multi-Channel Bit Rate Reduced Audio 
Signals" by ten Kate, preprint 3792, 96 th AES Convention, 1994, February 26 - March 01, 
Amsterdam, it is recognised that the MPEG-2 BC system is not working in an optimal way in 
case one of the signals in the multi-channel configuration is down-mixed to both the left and 
20 right channel of the stereo downmix. This is specifically the case for the Centre channel or 
for a monophonic Surround channel. The first situation is commonly referred to as the 
"Dominant Centre" situation. 



SUMMARY OF THE INVENTION 

25 It is an object of the invention to provide improved encoding of multiple input 

signals employing compatibility matrixing. To this end, the invention provides a method for 
encoding, a method for decoding, an apparatus for encoding, an apparatus for decoding, a 
signal format and a record carrier as defined in the independent claims. Advantageous 
embodiments are defined in the independent claims. 

30 According to a first aspect of the invention, the object is realized by encoding 

N input signals, with N>2, said encoding comprising: 

- generating from the N input signals a composition of M signals, with N>M ^, 

- encoding the composition of M signals into coded data, 

- encoding a selection of N-M out of the N input signals into coded data, 
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wherein the composition of M signals is orthogonalized prior to encoding 

Preferably, orthogonalization is done by switching between independent 

coding and sum/difference coding. For example, sum/difference signal coding of the 

compatible signal, i.e. the composition of M signals, is used in case of a dominant center 
5 situation or a dominant surround situation, and independent coding is used in other situations. 

In an embodiment of the invention, the encoder includes a control signal in the 

encoded signal to indicate to the decoder how the orthogonalizing has been performed and 

consequently how the de-orthogonalizing should be performed. 
Preferably, M=2. 

L0 Preferably, orthogonalization is done in the frequency domain. 

Preferably, switching between independent coding and sum/difference coding 
can be selected per frequency band. 

These and other aspects and embodiments of the invention will be apparent 
from the preferred embodiments(s) described hereinafter. 

.5 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be more clearly understood from the following 
description of the preferred embodiments of the invention read in conjunction with the 
attached drawings, in which: 
10 Fig. 1 illustrates a block diagram of a system in which the present invention is 

implemented; 

Fig. 2 illustrates a signal going out from an encoder, and 
Fig, 3 illustrates a flow diagram for a method according to a preferred 
embodiment of the invention. 

:5 

DETAILED DESCRIPTION OF THE INVENTION 

Fig. 1 illustrates an overall block diagram of a system 10 in which the present 
invention is implemented. The system 10 comprises a matrix 1 including downmixing and 
selection of N-M signals from the N input signals, an encoder 2 including a stereo encoder 2a 
0 and a surround extension encoder 2b, a multiplexer/formatter unit 3, a decoder 4, including a 
stereo decoder 4a and a surround extension decoder 4b, an inverse matrix 5 and switching 
unit 15 for switching the coding carried out in the encoder 2a between at least two coding 
modes. The system 10 illustrated in Fig. 1 shows a multi-channel encoder/multi-channel 
decoder system having down-mix in the encoder. 
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N input channels, e.g. a left channel L, a right channel R, a centre signal C, a 
left surround signal LS, and a right surround signal RS are first transmitted to the matrix 1 
and further to the encoder 2 comprising the stereo encoder 2a and a surround encoder 2b. The 
stereo encoder 2a encodes a composition of M=2 signals, e.g. L0 = L + C + LS and R0 = R + 
C + RS. The stereo encoder 2a further comprises an orthogonalizing unit 12 which 
orthogonalizes the composition of the M=2 signals, together with switching unit 15, e.g. by 
performing a switch to sum/difference coding of L0 and R0 in the case of dominant center or 
dominant surround. The orthogonalizing unit 12 further provides a control signal to indicate 
to the decoder how the orthogonalizing has been performed and consequently how the de- 
orthogonalizing should be performed. The encoding preferably is a so-called "perceptual 
audio encoding", whereby each of a succession of time domain blocks of an audio signal is 
coded in the frequency domain. Specifically, the frequency domain representation of each 
block is divided into bands, each of which is coded based on psycho-acoustic criteria, so that 
the audio signal is compressed efficiently. Other types of coding schemes are also possible, 
but are not further described in this example. 

The encoded signal is multiplexed/formatted in the multiplexer/formatter unit 
3 and transmitted as a signal Qout to the decoder 4 as a composition of M signals in a first 
bit-stream and a selection of N-M signals in a second bit-stream (illustrated as two arrows 
going into the decoder 4). The signal Qout is illustrated in Fig. 2, which illustrates the two bit 
streams "onto" each other. Each bit-stream comprises a header 7 and data fields 8 and/or 9. 
The control signal indicating how the orthogonalizing has been performed, may be included 
in a header 7 of the first and/or the second bitstream. 

Alternatively, the coded data representing the orthogonalized composition of 
M signals and the coded data representing the selection of N-M signals are included in the 
same bit-stream, e.g. in the data fields 8 and 9 respectively. The control signal, indicating 
how the orthogonalizing has been performed, may then be included in the header 7. 

The decoder 4 comprises a stereo decoder 4a and a surround extension decoder 
4b. Matrix 5 derives the original 5 channels from the decoded stereo stream and the 
additional decoded three channels. Matrix 5 performs an operation which is inverse or 
substantially inverse to the operation performed in matrix 1 . The stereo decoder 4a further 
comprises a de-orthogonalizing unit 14 which de-orthogonalizes the composition of the M=2 
signals, after decoding, e.g. by switching to sum/difference decoding or independent 
decoding in dependence on the control signal indicating to the decoder how the 
orthoganilizing has been performed and consequently how the de-orthoganilizing should be 
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performed. This control signal, which originates from unit 12 is included in the coded data 
stream. 

Fig. 3 illustrates a flow diagram of a method according to a preferred 
embodiment for encoding the N input signals, In a first step 101, the N input signals are 
5 transformed to a frequency domain representation prior to encoding. In a second step 102, it 
is determined whether a dominant center situation or dominant surround situation occurs 
(indicated Y) or not (indicated N). If Y, then the sum/difference coding mode (step 103) is 
selected. If N, then the signals are independently coded. The actual coding takes place in step 
104. In step 104, the composition of M signals is coded into a bit stream of data, typically a 
0 first bit-stream and a selection of N-M out of the N input signals is coded into another bit- 
stream of data, typically a second bit-stream of data. Steps 102 and 103 together are also 
referred to as the orthogonalization step. 

As is clear to the skilled person, the decoding operation is inverse or 
substantially inverse to the encoding operation. 

5 

Examples of matrix equations will be described below to explain embodiments 
of the invention better. The matrix equations 1-21 describe a situation where the present 
invention is not applied. These equations are shown to describe the encoding and decoding 
before describing the equations of a preferred embodiment of the invention for a better 
,0 understanding of the invention. 

Example matrix equations are the following (gain factors are omitted for 

clarity): 

At the encoder side: 
L0 = L + C + LS (1) 
5 R0-R + C + RS (2) 
T3 - C (3) 
T4 = LS (4) 
T5-RS (5) 
where the transmission channels are: L0 , R0, T3, T4 and T5. 

0 

At the decoder side: 
C' = T3' (6) 
LS-T4' (7) 
RS'=T5' (8) 
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L '= L0' - C - LS' - LO' - T3' - T4' (9) 
R' =R0' - C - RS' = RO' - T3' - T5' (10) 
where the sign ' denotes a decoded signal. 

5 Although the matrix inversion at the decoder side is exact, the equations above 

do not yield exactly the original input signals, because the transmission channels LO, RO, T3, 
T4 and T5 are altered by the encoding. 

The coding of T3, T4 and T5 is directly controlled by the perceptual encoder 
and consequently C, LS' and RS' will not give rise to quality problems. In the example 

0 presented above, due to the matrixing, the coding noise in LO, T3 and T4 will appear in I/, 
and the coding noise in RO, T3 and T5 will appear in R\ This coding noise could be 
minimized by choosing appropriate extra channels to be transmitted with LO and RO. If C, LS 
and RS are the weakest signals, then the coding noise in L' and R' will be dominated by LO' 
and RO', respectively, which is again directly controlled by the perceptual encoder. If another 

5 signal combination is the weakest, this signal combination should be chosen to be transmitted 
as T3,T4andT5. 

However, when the center signal C is the strongest signal (in the following 
referred to as the "dominant center" situation), LO is almost equal to Ro. 

It can be shown that one of the small signals always needs to be retrieved by 
0 subtracting two large almost equal signals to obtain a small signal. This can be represented by 
the following formulas: 



At the encoder side: 

L0 = L + C + LS (11) 

5 R0 = R + C + RS (12) 

T3 = L (13) 

T4 = LS (14) 

T5=RS (15) 

0 At the decoder side: 

L' = T3' (16) 

LS'= T4 r (17) 

RS' = T5' (18) 

C = LO' - L' - LS' = LO' - T3' - T4' (19) 
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R' = R0'-C'-RS' = R0'-C'-T5' (20) 
= RO' - LO' + T3' + T4' - T5 ' (21) 

where R' is small, RO' and LO' are both large and T3 ', T4' and T5' are all small. It is clear 
that a relatively small error in LO or RO will lead to a relatively large and clearly audible error 

5 in the resulting signal R'. The quality could be maintained; but only by coding at least one of 
the compatible signals LO, RO at a much higher bit-rate than is necessary for good sound 
quality of that signal on itself. Another way could be to code additional transmission 
channels, in this case for instance four, but this is typically a waste of bandwidth as well. 
Therefore, according to an aspect of the invention, there is provided an encoder for 

.0 sum/difference coding of the compatible signal in case of a dominant center situation. In this 
way, the center signal C falls out of one of the equations for the compatible signal, and that 
equation can be used to calculate a fourth small signal. Of course, for a non-dominant 
situation, everything can remain the same. For a dominant situation, a matrixing of the 
compatible signal is added: 

.5 

At the encoder side: 



L0 = L + C+LS (22) 

R0 = R + C + RS (23) 

T3=L (24) 

10 T4 = LS (25) 

T5=RS (26) 



15 



ChO = L0 + R0 = L + R + 2C + LS + RS (27) 
Chl=L0-R0 = L-R + LS-RS (28) 



At the decoder side: 

L'=T3' (29) 

LS'=T4' (30) 

RS'=T5' (31) 

JO R'=L'+LS'-RS'-Chl' = T3' + T4'-T5'-Chl' (32) 

2C = ChO' - L' - R' - LS' - RS' = ChO' + Chi' - 2T3' - 2T4' (33) 



Now R' can be obtained from small signals only, C from one strong signal 
(ChO') plus a number of small signals. The situation wherein strong signals are subtracted 
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from each other to obtain a small signal is avoided in this way. In the compatible stereo 
decoder 4a, the following matrix has to be performed: 



L0 = (Ch0 + Chl)/2 (34) 
R0 = (Ch0-Chl)/2 (35) 

Another situation where the invention finds application is when the compatible 
signal (LO, R0) includes a matrixed surround signal, i.e. monophonic surround (S=f(LS+RS)) 
in the downmix and when S is the strongest signal. This is referred to as a so-called 
"dominant surround situation". In this situation, L0 is in amplitude almost equal to R0 but in 
anti-phase. Selecting the left channel L, the right channel R and the centre signal C for 
transmission in the T3, T4 and T5 makes it impossible to retrieve LS and RS with an inverse 
matrix. It can be shown that always one of the small signals needs to be retrieved by adding 
LO' and R(T. The weakest of LS and RS should be selected as the third additional signal. This 
is illustrated in an example below: 

At the encoder side: 



L0 = L + C-LS-RS (36) 

RO-R + C + LS + RS (37) 

T3 = C (38) 

T4 = L (39) 

T5=RS (40) 



At the decoder side: 

C' = T3' (41) 

L'= T4' (42) 

RS'=T5' (43) 

LS '= L' + C - L0' - RS' = T4' + T3'- L0'- T5 (44) 

R'=R0'-C'-LS'-RS'=R0'-T3'-LS'-T5' (45) 



Due to the fact that LO' and R0' are in anti-phase this means adding two large 
almost equal signals to obtain a small signal R'. It is clear that a relatively small error in LO' 
or RO' will lead to a relatively large and clearly audible error in the resulting signal. The 
quality can still be maintained, but only by coding at least one of the compatible signals with 
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a much higher bit-rate than necessary for good sound quality of that signal on itself. Also in 
this case could another way be to code additional transmission channels at the cost of waste 
of bandwidth. 

According to another preferred embodiment of the invention, a matrixing of 
5 the compatible signal is added according to the following equations: 



At the encoder side: 

L0 = L + C-LS-RS (46) 

R0 = R + C + LS + RS (47) 

0 T3 = C (48) 

T4 = L (49) 

T5 =RS (50) 

ChO = LO + RO = L + R + 2C (51) 

5 Chl=L0-R0 = L-R-2LS-2RS (52) 

At the decoder side: 

C'=T3' (53) 

L'=T4' (54) 

0 RS'=T5' (55) 

R'= ChO'- 1/- 2C = ChO'- T4'- 2T3' (56) 

2LS' = L' - R' - 2RS' - Chi ' = T4' - R'- 2T5 ' - Chi ' (57) 



Now R' is obtained from only small signals, LS ' from one strong signal 
(Chi') plus a number of small signals. The situation that strong signals are subtracted from 
each other to obtain a small signal is avoided in this way. In the compatible stereo decoder, 
the following matrix has to be performed: 

Lo = (ChO + Chl)/2 (58) 
Ro = (Ch0-Chl)/2 (59) 



The invention finds application for instance in multi-channel music 

distribution. 
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The coded data can be stored and subsequently read, decoded and presented to 
a listener of a record carrier. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 

5 embodiments without departing from the scope of the appended claims. Ih the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 

0 enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



